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(57) Abstract 

A method for recognizing a spoken audio signal as input 
data to a computer includes receiving the spoken audio signal 
at an input device of the computer (step 1 A), receiving a typed 
first letter of the spoken audio signal at the input device of the 
computer (step IB), and searching through entries that begin 
with the typed first letter in a dictionnary in the computer for 
a best match with the spoken audio signal (step 2). 
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SPEECH RECOGNITION COMPUTER INPUT AND DEVICE 
g & Hit GROUND O F TH* INVENTION 

1 . Field of the Invention 

The present invention relates to a 
5 computer interface method and device wherein a 
user's voice is used to input data to a computer. 
More particularly, the present invention pertains to 
a method and apparatus wherein a user's voice, in 
conjunction with a keyboard, is used for inputting 
10 instructions or data to a computer. Most 
particularly, the present invention relates to a 
voice recognition computer input method and 
apparatus wherein a user inputs data into a computer 
by speaking each word while typing, via a keyboard, 
15 the first letter of each spoken word to assist the 
computer in identifying or recognizing the spoken 
words . 

2 . Description of th e Related Art 
20 There currently exists various inexpensive 

software programs for use with personal computers 
that allow voice input of text and instructions. 
Some such programs are offered by Dragon Systems, 
IBM and Kurzweil. These programs allow a user's 
25 speech to be input to a computer via, for example, 
an -audio input device, whereupon the spoken words 
are identified and used for various applications 
such as for text input or to execute commands. As 
can be appreciated, such man-machine dictation 
30 programs save users time by alleviating the need to 
enter data via a keyboard to generate documents. 

With reference to FIG. 1, a diagram of the 
steps used in a prior art program for effectuating 
speech input to a computer is depicted. As shown, 
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the prior art program is divided into five separate 
steps. The first step involves the user speaking in 
proximity to a computer so that a sound detector can 
receive the spoken audio signal. The computer then 

5 determines when each word begins. For each spoken 
word, a dictionary search is conducted to locate 
potential matches between stored dictionary words 
and each spoken word. The best dictionary word 
match is then selected from the potential matches 

10 and the selected word is forwarded to an 

application. 

Unfortunately, such prior art programs 
suffer several drawbacks. Specifically, the error 
rate for such programs remains high. Moreover, 
15 problems exist with continuous speech input, thus 
requiring an awkward style of spoken input by 
pausing at intervals to allow the program to 
identify the spoken words and process the 
information. Also, such prior art programs must be 
trained prior to use. In other words, prior to use 
a user must take time to record a large sample 
vocabulary for the machine to compare with inputs. 

As a result of the drawbacks of prior art 
programs, users are left with the situation of 
25 continuing to rely upon old methods for accurate, 
convenient text input, which involve either someone 
to take dictation and then enter the spoken words 
into a computer via a keyboard, or fast typing 
skills. Moreover, the continuous and frequent use 
of a keyboard results in a risk of repetitive stress 
injuries and fatigue. 
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B dtp B TW.qraTPTIOT" OF THK DRAWINGS 

In the drawings: 

FIG. 1 is a block diagram depicting the 
steps of a prior art voice input method; 

5 * FIG. 2 is a block diagram depicting the 

steps of an embodiment of the voice recognition 
computer input method and device in accordance wxth 
the present invention; and 

FIG. 3 is a block diagram depicting the 

10 steps of another embodiment of the voice recognition 
computer input method and device in accordance wxth 
the present invention 

~ DETAILED DESCRIPTION of THE PFFSKNTTiY PREFERRED 
•EMBODIMENTS 

15 The present invention utilizes known voxce 

recognition technology to allow a user to type the 
initial letter of each spoken word, via a keyboard, 
substantially simultaneously with the spoken word, 
thereby reducing the processing power and txme 
20 required by a computer. The invention overcomes the 
drawbacks of the prior art in several respects, 
initially, continuous speech becomes possible 
because the computer is advised exactly when a new 
word begins - by keyboard entry of the first letter 
25 of each word - thus, vastly reducing processxng 
power which -was required in prior art systems to 
determine when one word ended and another began. In 
addition, the possible choice of words is vastly 
reduced for the computer since the initial letter xs 
30 known via keyboard input. This feature further 
reduces the required processing power and txme 
because fewer words from the dictionary need to be 
located and retrieved. This feature also increases 
the accuracy of the selected or retrieved word 
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because the first letter of the spoken word is 
provided to the computer. 

in addition to the foregoing benefits, the 
present invention requires significantly less typing 
than standard keyboard entry. As a result, smaller 
keyboards relative to standard sized keyboards may 
be used, thereby increasing user workstation surface 
area that was previously occupied by a standard size 
keyboard. Moreover, the present invention is 
particularly well-suited for use with palm-sized 
computers or Personal Digital Assistants (PDA's). 
Furthermore, as less typing is required for data 
input, there is a reduced chance that injury, 
fatigue and error will occur. 

As a result of the present invention, 
speaker independent programs can become feasible 
m uch more easily since the reliance on the 
computer's ability to discriminate variations m 
pronunciation is considerably diminished. 

The present invention is depicted in Fig. 
2 As shown, steps 1A and IB are performed 
substantially simultaneously with each other. In 
the alternative, a buffer register may be employed 
to temporarily store either the spoken word or the 
typed letter while awaiting input of the other of 
- the corresponding spoken word or input letter. Once 
both a spoken word and a letter are input to the 
computer, a search is conducted among dictionary 
entries, starting with the typed letter, to locate 
potential matches. The best match is then selected 
and the selected word is forwarded to an 

application. 

in addition to the above, Fig- 3 shows 
that the present invention may assign or designate 
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single or multiple words to specific functions which 
are performed when the designated word is spoken 
without the first letter of the word being typed as 
in steps 5 and 6 of Fig. 3. As an illustration, the 
5 word "delete" may be assigned the function of 
erasing the last typed word when the word "delete" 
is spoken without typing the letter »d" . Thus, if 
the computer incorrectly identifies a word based on 
the spoken word and typed in first letter of the 
10 spoken word, a user can strike or erase the 
incorrect word by saying the word "delete". Of 
course, if the word "delete- is intended to be 
input, the word will be spoken and the letter "d" 
will be pressed on the keyboard. Such a function 
15 can be performed on any incorrect word by simply 
moving the cursor to the incorrect word and saying 
"delete". Step 5 comprises the step of comparing 
the spoken audio signal to a list of predetermined 
functions. Step 5 may optionally comprise the steps 
20 of determining whether the user types a first letter 
and then comparing the spoken audio signal to the 
list of predetermined functions. 

The invention is not limited by the 
embodiments described above which are presented as 
examples only but can be modified in various ways as 
will be readily recognized by those having ordinary 
skill in the art. For example, instead of typing 
only the first letter of each spoken word, the first 
and last letters can be typed, or, alternatively, 
the first two letters can be typed, thereby further 
reducing computer processing power and time. 
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CLAIMS 

1. A method for facilitating the 
identification of a spoken audio signal in a voice 
recognition computer application, the method 
5 comprising the steps of: 

a. receiving a spoken audio signal at an 
input device of a computer; 

b. receiving a typed first letter of the 
spoken audio signal at the input device of the 

10 computer; 

c. after steps a. and b., searching entries 
starting with the typed first letter in a dictionary 
of the computer and selecting a best match; and 

d. inserting the best match into a currently 
15 running application of the computer. 

2. The method of claim 1, wherein said 
steps a. and b. are performed substantially 
simultaneously . 

20 

3 . The method of claim 1 , wherein one of 
said steps a. and b. is performed before the other 
of said steps a. and b., and said computer interface 
method further comprises the step of storing said 
25 one of said steps a. and b. in a buffer register 
until the other of said steps a. and b. is 
performed. 

4. The method of claim 1, further 

30 comprising the steps of: 

f. determining whether the spoken audio 

signal comprises a predetermined function; 



WO 99/00790 



PCT/US98/13118 



g . performing the predetermined function if 
it is determined in step f. that the spoken audio 
signal comprises the predetermined function; and 

h . performing step c. if the spoken audio 
interface is not the predetermined function. 
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